This is a panic download of occupywallst.org as of 2012-08-22. I grabed all images that are linked on occupywallst.org too. Images from i.imgur.com, imgur.com and 2439-occupywallst-com.voxcdn.com are captured into other warc.gz and .tar.gz archives.
NOTE1: It started very late on 2012-08-22 and continued into 2012-08-23.
NOTE2: that 2439-occupywallst-com.voxcdn.com is the same as occupywallst.org.
August 28, 2012 Subject:
Question about .warc files
how can we extract the content of .warc files ?
Is there a software, an utility ?
I looked on Google but it's very complicated, you must intall servers, java, other stuff so complicated, but this is if you want to BROWSE the archived website.
But I only want to extract some of the files, not all the html pages or any other meta information ...